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Abstract 

In this article we introduce a three-parameter extension of the bivariate expo- 
nential-geometric (BEG) law (Kozubowski and Panorska, 2005). We refer to this 
new distribution as bivariate gamma-geometric (BGG) law. A bivariate random 
vector (X, N) follows BGG law if N has geometric distribution and X may be 
represented (in law) as a sum of N independent and identically distributed gamma 
variables, where these variables are independent of N . Statistical properties such as 
moment generation and characteristic functions, moments and variance-covariance 
matrix are provided. The marginal and conditional laws are also studied. We show 
that BBG distribution is infinitely divisible, just as BEG model is. Further, we 
provide alternative representations for the BGG distribution and show that it en- 
joys a geometric stability property. Maximum likelihood estimation and inference 
are discussed and a reparametrization is proposed in order to obtain orthogonal- 
ity of the parameters. We present an application to the real data set where our 
model provides a better fit than BEG model. Our bivariate distribution induces 
a bivariate Levy process with correlated gamma and negative binomial processes, 
which extends the bivariate Levy motion proposed by Kozubowski et al. (2008). 
The marginals of our Levy motion are mixture of gamma and negative binomial 
processes and we named it BMixGNB motion. Basic properties such as stochastic 
self-similarity and covariance matrix of the process are presented. The bivariate dis- 
tribution at fixed time of our BMixGNB process is also studied and some results are 
derived, including a discussion about maximum likelihood estimation and inference. 

Keywords: Bivariate gamma-geometric law; Characteristic function; Infinitely di- 
visible distribution; Maximum likelihood estimation; Orthogonal parameters; Levy 
process. 



1 Introduction 



Mixed univariate distributions have been introduced and studied in the last years by com- 
pounding continuous and discrete distributions. Marshall and Olkin (1997) introduced a 
class of distributions which can be obtained by minimum and maximum of independent 
and identically distributed (iid) continuous random variables (independent of the random 
sample size), where the sample size follows geometric distribution. 

Chahkandi and Ganjali (2009) introduced some lifetime distributions by compound- 
ing exponential and power series distributions; this models are called exponential power 
series (EPS) distributions. Recently, Morais and Barreto-Souza (2011) introduced a class 
of distributions obtained by mixing Weibull and power series distributions and studied 
several of its statistical properties. This class contains the EPS distributions and other 
lifetime models studied recently, for example, the Weibull-geometric distribution (Mar- 
shall and Olkin, 1997; Barreto-Souza et al., 2011). The reader is referred to introduction 
from Morais and Barreto-Souza's (2011) article for a brief literature review about some 
univariate distributions obtained by compounding. 

A mixed bivariate law with exponential and geometric marginals was introduced by 
Kozubowski and Panorska (2005), and named bivariate exponential-geometric (BEG) 
distribution. A bivariate random vector (X, N) follows BEG law if admits the stochastic 
representation: 



where the variable N follows geometric distribution and {Xj}^ is a sequence of iid 
exponential variables, independent of N. The BEG law is infinitely divisible and therefore 
leads a bivariate Levy process, in this case, with gamma and negative binomial marginal 
processes. This bivariate process, named BGNB motion, was introduced and studied by 
Kozubowski et al. (2008). 

Other multivariate distributions involving exponential and geometric distributions 
have been studied in the literature. Kozubowski and Panorska (2008) introduced and 
studied a bivariate distribution involving geometric maximum of exponential variables. 
A trivariate distribution involving geometric sums and maximum of exponentials variables 
was also recently introduced by Kozubowski et al. (2011). 

Our chief goal in this article is to introduce a three-parameter extension of the BEG 
law. We refer to this new three-parameter distribution as bivariate gamma-geometric 
(BGG) law. Further, we show that this extended distribution is infinitely divisible, and, 
therefore, it induces a bivariate Levy process which has the BGNB motion as particular 
case. The additional parameter controls the shape of the continuous part of our models. 

Our bivariate distribution may be applied in areas such as hydrology and finance. 
We here focus in finance applications and use the BGG law for modeling log-returns 
(the Xi's) corresponding to a daily exchange rate. More specifically, we are interested in 
modeling cumulative log-returns (the X) in growth periods of the exchange rates. In this 
case N represents the duration of the growth period, where the consecutive log-returns 
are positive. As mentioned by Kozubowski and Panorska (2005), the geometric sum 
represented by X in Q is very useful in several fields including water resources, climate 
research and finance. We refer the reader to the introduction from Kozubowski and 
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Panorska's (2005) article for a good discussion on practical situations where the random 
vectors with description ([!]) may be useful. 

The present article is organized as follows. In the Section 2 we introduce the bivariate 
gamma-geometric law and derive basic statistical properties, including a study of some 
properties of its marginal and conditional distributions. Further, we show that our pro- 
posed law is infinitely divisible. Estimation by maximum likelihood and inference for large 
sample are addressed in the Section 3, which also contains a proposed reparametrization 
of the model in order to obtain orthogonality of the parameter in the sense of Cox and 
Reid (1987). An application to a real data set is presented in the Section 4. The induced 
Levy process is approached in the Section 5 and some of its basic properties are shown. 
We include a study of the bivariate distribution of the process at fixed time and also 
discuss estimation of the parameters and inferential aspects. We close the article with 
concluding remarks in the Section 6. 



2 The law and basic properties 

The bivariate gamma-geometric (BGG) law is defined by the stochastic representation 
Q and assuming that {Xi}^ is a sequence of iid gamma variables independent of N and 
with probability density function given by g(x; a, 0) = f3 a /T(a)x a ~ 1 e~ l3x , for x > and 
a, (3 > 0; we denote X,- t ~ T(a,(3). As before, N is a geometric variable with probability 
mass function given by P(N = n) = p(l — p) n_1 , for n G N; denote iV ~ Geom(p). 
Clearly, the BGG law contains the BEG law as particular case, for the choice a — 1. The 
joint density function fx,N{'> •) of (X, N) is given by 

ana 

fxA*> n ) = ^^ a_1 e^>(l - P) n ~\ x>0, neN. (2) 
1 (an) 

Hence, it follows that the joint cumulative distribution function (cdf) of the BGG 
distribution can be expressed by 

P(X <x,N< n)=pJ2(l- P y- lT ^ 3a) 



3=1 W ' 

for x > and neN, where T x (a) = jQt a ~ 1 e~ t dt is the incomplete gamma function. 

We will denote (X, N) ~ BGG(/3, a,p). We now show that (pX,pN) A (aZ//3,Z) as 

p — > + , where '-4' denotes convergence in distribution and Z is a exponential vari- 
able with mean 1; for a = 1, we obtain the result given in the proposition 2.3 from 
Kozubowski and Panorska (2005). For this, we use the moment generation function 
of the BGG distribution, which is given in the Subsection 2.2. Hence, we have that 
E(e tpX+spN ) = <f(pt,ps), where <f(-,-) is given by Using L'Hopital's rule, one may 
check that E(e tpX+spN ) — > (1 — s — atf/3)^ 1 as p — > + , which is the moment generation 
function of (aZ//3, Z). 
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2.1 Marginal and conditional distributions 

The marginal density of X with respect to Lebesgue measure is an infinite mixture of 
gamma densities, which is given by 

f x (x) =f2P(N = n)g(x;na,(3) = V -^— ]P IM^M! x > . (3) 

n=l n=l 

Therefore, the BGG distribution has infinite mixture of gamma and geometric marginals. 
Some alternative expressions for the marginal density of X can be obtained. For example, 
for a = 1, we obtain the exponential density. Further, with help from WolfranQ for 

a = 1/2,2,3,4, we have that 

f x (x) = p/3 1 / 2 x- 1/2 e~ f3x {a(x)e a{x)2 (l + erf(a(x))) + tt" 1 / 2 }, 

pf3e~^ x I 

fx(x) = R sinh(/3a; a/1 - p), 

f x (x) = ^" le "% (^) 1/3 e~ a(a:)1/3/2 {e 3a W 1/3 / 2 - 2sin(l/6(3v / 3a(x) 1 / 3 + vr))}, 
3(1 p) 

and 

fx(x) = ^-^a(x) 1 / 4 {smh(a(x) 1 /4) _ sm^x) 1 ' 4 )}, 
2(1 -p) 

respectively, where a{x) = a(x; (3, a,p) = (1 — p)(f3x) a and erf(x) = 27r~ 1 / 2 f* e~ t2 ^ 2 dt is 
the error function. Figure [T] shows some plots of the marginal density of X for (3 = 1, 
p = 0.2, 0.8 and some values of a. 




Figure 1: Plots of the marginal density of X for (3 = 1, a = 0.5, 1,2,3,4, p = 0.2 (left) 
and p = 0.8 (right). 

We now obtain some conditional distributions which may be useful in goodness-of-fit 
analyses when the BGG distribution is assumed to model real data (see Section 4). Let 

1 http://www. wolframalpha.com/ 
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m < n be positive integers and x > 0. The conditional cdf of (X, X) given X < n is 



P(X < x,X < m|X < n) 



;i- P )» 



£(i-p) j 



r(ja) ' 



We have that P(X < a;|X < n) is given by the right side of the above expression with n 
replacing m. 

For < x < y and n e N, the conditional cdf of (X, N) given X < y is 



E~i(i-^'-%,0'«)/iW 



The conditional probability P(JV < n|X < y) is given by the right side of the above 
expression with y replacing x. 

From ([2]) and ([3]), we obtain that the conditional probability mass function of X given 
X = x is 



P(N = n\X = x) 



[(1 -p)((3x) a } n /T(an) 

£Si[(i-p)(/feO a ]Vr(ja) 



for n G N. If a is known, the above probability mass function belongs to the one- 
parameter power series class of distributions; for instance, see Noack (1950). In this case, 
the parameter would be (1 —p)(/3x) a . For a = 1, we obtain the Poisson distribution trun- 
cated at zero with parameter f3x(l —p), which agrees with formula (7) from Kozuboswki 
and Panorska (2005). For the choice a = 2, we have that 



P{N = n\X = x) 



2n-l 



(l-p) n - 1 / 2 (f3x) 
(2n - 1)! sinh(^xvT^p) ' 



where n G N. 



2.2 Moments 

The moment generation function (mgf) of the BGG law is 



<p(t, s) = E (e tx+sN ) = E [e sN E (e tx \N)] = E 
and then 

(p(t,s) 



P 



P-t 



N ' 



pe s (3 c 



(/3 -t) a - e s /3 a (l-p)' 



t < ft, s G 



(4) 



for t < ft{l — [(1 — p)e s ] 1 ' a }. The characteristic function may be obtained in a similar 
way and is given by 



$(M) 



pe is ft a 



(ft - it) a - e is ft a (l -p)' 



(5) 
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for t, s G R. With this, the product and marginal moments can be obtained by computing 
E(X m N k ) = d m+k (p(t,s)dt m ds k \ t , s=0 or E(X m N k ) = (-i) m+k d m+k $(t, s)dt m ds k \ t , s=0 . 
Hence, we obtain the following expression for the product moments of the random vector 
(X,N): 

gFy^^ rl!"^' («> 

v > pm B(an,m) v ; 

where B(a,b) = T(a)T(b)/T(a + b), for a,b > 0, is the beta function. In particular, we 
obtain that E(X) = E(N) = p~ l and the covariance matrix S of (X,N) is 

given by 



(l-p)q 2 | a 




__ (7) 

The correlation coefficient p between X and N is p = v/(l — p)/(l — p + p/a). Let 
p* = y/1 — p, that is, the correlation coefficient of a bivariate random vector following 
BEG law. For a < 1, we have p < p*, and for a > 1, it follows that p > p*. Figure [2] 
shows some plots of the correlation coefficient of the BGG law as a function of p for some 
values of a. 




p 



Figure 2: Plots of the correlation coefficient of the BGG law as a function of p for 



a = 0.1,0.5,1,1.5,3. 



From Q , we find that the marginal mgf of X is given by 

p(3 a 



<p(t) 



(0-t)*-p°(i- P y 



for t < - (1 - p) 1 /"}. The following expression for the rth moment of X can be 
obtained from above formula or ([6]): 



pT{r) (1 — p) 



E(X') = 



n— 1 



/3 r ^ S(an,r) 



We notice that the above expression is valid for any real r > 0. 
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2.3 Infinitely divisibility, geometric stability and representa- 
tions 



We now show that BGG law is infinitely divisible, just as BEG law is. Based on 
Kozubowski and Panorska (2005), we define the bivariate random vector 



/1+nT \ 



where the GVs are iid random variables following T(a/n, (3) distribution and independent 
of the random variable T, which follows negative binomial NB(r,p) distribution with the 
probability mass function 

P(T = k) = I ^^p r (l-p) k , fcGNU{0}, (8) 

where r = 1/n. The moment generation function of (R,v) is given by 
E(e tR+sv ) = E[e s /" +sT E(e^-" T ^|T^ 



( . 8 y /n E 



pe s (3 a 



0-t 



((3 - t) a - e s (3 a (l -p) 

which is valid for t < — [(1 — p)e s } 1 ' a } and sGR. In a similar way, we obtain that 
the characteristic function is given by 

{ ' \(p-itr-e*^(l-p) J ' K ' 

for t,sGR. With this, we have that E(e ltR+lsv ) = s) 1//n , where s) is the charac- 
teristic function of the BGG law given in ([5]). In words, we have that BGG distribution 
is infinitely divisible. 

The exponential, geometric and BEG distributions are closed under geometric sum- 
mation. We now show that our distribution also enjoys this geometric stability property. 
Let {(Xi, iVj)}^ be iid random vectors following BGG(f3,a,p) distribution independent 
of M, where M ~ Geom(g), with < q < 1. By using Q and the probability generation 
function of the geometric distribution, one may easily check that 

M 

J2{Xi,N { ) ~BGG((3, a, pq). 
i=i 

From the above result, we find another stochastic representation of the BGG law, which 
generalizes proposition (4.2) from Kozubowski and Panorska (2005): 

M 



(X,A0 = ^( X »^ 
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where {(X h Ni)}^ ~ BGG(/3, a,p/q), with < p < q < 1, and M is defined as 
before. In what follows, another representation of the BGG law is provided, by showing 
that it is a convolution of a bivariate distribution (with gamma and degenerate at 1 
marginals) and a compound Poisson distribution. Let {Zi}^ be a sequence of iid random 
variables following logarithmic distribution with probability mass function P(Zi = k) = 
(1 — p) k (\k)~ l , for fceN, where A = — log p. Define the random variable Q ~ Poisson(A), 
independent of the Zj's. Given the sequence {Zi}^, let G t ~ T(aZi,/3), for i e N, be 
a sequence of independent random variables and let G ~ T(a,(3) be independent of all 
previously defined variables. Then, we have that 



(X,N)±(G,l)+J2(G i ,Z i ). 



(10) 



i=i 



Taking a = 1 in (|10j), we obtain the proposition 4.3 from Kozubowski and Panorska 
(2005). To show that the above representation holds, we use the probability generation 
functions E (t Q ) = e x{t ~ l) (for t G R) and E (t z >) = log(l - (1 - p)t)/\ogp (for t < 
(1 -pY 1 ). With this, it follows that 



P 



for t < f3. Furthermore, for t < (3{1 — [(1 — p)e 8 \ '"}, we have that 



p-t 

P 
p-t 

sll/al 



E 



{[ E ( ( 



tGi+sZx 



)]' 



3 A{E(e iG i+ sZ i)-l} 



11' 



3 tGi+sZi 



e s p c 



(P-t)" 



log{l - (1 - p)e s p a /(P - t) a } 
\ogp 



By using the above result in (11), we obtain the representation (10). 



3 Estimation and inference 



Let (Xi,N\), (X n ,N n ) be a random sample from BGG(/3, a,p) distribution and 
9 = (P, a,p) T be the parameter vector. The log-likelihood function £ = £(6) is given by 



oc not log P N n + n\ogp — n(3X n + nlog(l — p)(N n — 1) 

n 

+ J2{aN l logX l - \ogT(aN t )} , 



(12) 
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where X n = Y^i=\Xi/n ari( ^ N n = Y17=i^/ n - The associated score function U(9) 
(d£/dp,d£/da,d£/dp) T to log-likelihood function (12) comes 



d£_ 
dp 



n 



AT \ r^O 

n " -X n ), —=nN n logP + Y t {NitogX i -N i *(a £ N i )} 



P 



da 



i=l 



S 



and 



di_n n{N n - 1) 
op p 1 — p 

where ty(x) = d\ogT{x)/dx. By solving the nonlinear system of equations U(Q) = 0, it 
follows that the maximum likelihood estimators (MLEs) of the parameters are obtained 
by 



P = a JT> P N 



r and V^*(aJVO-nJV n log(^) =y)JV' i logX < . (14) 

V " i=l \ X n J . =1 



Since MLE of a may not be found in closed-form, nonlinear optimization algorithms such 
as a Newton algorithm or a quasi-Newton algorithm are needed. 

We are now interested in constructing confidence intervals for the parameters. For 
this, the Fisher's information matrix is required. The information matrix J(9) is 

J(9) = I Kp a n aa I , (15) 



^pp 



with 



where ty'(x) = d^>(x)/dx. 



Standard large sample theory gives us that y/n(9 — 9) -4- N 3 (0, J 1 (9)) as n — > oo 



where J 1 (6 I ) is the inverse matrix of J (9) defined in (15). 

The asymptotic multivariate normal distribution of y/n{9 — 9) can be used to construct 
approximate confidence intervals and confidence regions for the parameters. Further, 
we can compute the maximum values of the unrestricted and restricted log-likelihoods 
to construct the likelihood ratio (LR) statistic for testing some sub-models of the BGG 
distribution. For example, we may use the LR statistic for testing the hypotheses H Q : a = 
1 versus H\\ a 1, which corresponds to test BEG distribution versus BGG distribution. 



3.1 A reparametrization 

We here propose a reparametrization of the bivariate gamma-geometric distribution and 
show its advantages over the previous one. Consider the reparametrization n = a/f3 and 
a and p as before. Define now the parameter vector 9* = (fi,a,p) T . Hence, the density 
(J2j) now becomes 

(<y I n) na 

f* XN {x, n) = ( \ x na - l e- ax/ »p{l - p) n ~\ x > 0, n E N. 
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We shall denote (X, N) ~ BGG(/i, a,p). Therefore if (X 1; Ni), . . . , (X n , N n ) is a random 
sample from BGG(/z, a,p) distribution, the log-likelihood function t = £(9*) is given by 



a \ — a — — 

oc na\og [ — ) N n + n\ogp — n—X n + nlogfl — p)(N n — 1) 



+ J2{aN l logX l -\ogT(aN l )} . 



(16) 



The score function associated to ([16]) is U*(9*) = (dt /dp, dt /da, dt/dp) T , where 

nNJog (-} +J2 N SogX l -^(aN l )} 



dt na fX 



dp, p \ p 



N r . 



dt_ 
da 



and dt /dp is given by (13). The MLE of p is given (as before) in (14), and the MLEs of 
p and a are obtained by 

X " ( N \ n 

p = ^ and Yl W(&V 4 ) ~ nN *n log [a^ = ^ Ni log X t . 



N„ 



8=1 



8=1 



As before nonlinear optimization algorithms are needed to find MLE of a. Under this 
reparametrization, Fisher's information matrix J* (8*) becomes 



J* 
















* 











K* 

^pp 



with 



— K* 
Utt 5>— ' act 

PP P Z P 



pj^f^-py^'u 

i=i 



a) 



ap 



and — ft'pp' 



The asymptotic distribution of ^Jn(9* —9*) is trivariate normal with null mean and covari- 
ance matrix J*' 1 (9*) = diag{l/fc* , t/k* aa , l/k pp }. We see that under this reparametriza- 
tion we have orthogonal parameters in the sense of Cox and Reid (1987); the information 
matrix is a diagonal matrix. With this, we obtain desirable properties such as asymptotic 
independence of the estimates of the parameters. The reader is referred to Cox and Reid 
(1987) for more details. 



4 Application 

Here, we show the usefulness of the bivariate gamma-geometric law applied to a real data 
set. We consider daily exchange rates between Brazilian real and U.K. pounds, quoted 
in Brazilian real, covering May 22, 2001 to December 31, 2009. With this, we obtain the 
daily log-returns, that is, the logarithms of the rates between two consecutive exchange 
rates. Figure [3] illustrates the daily exchange rates and the log-returns. 

We will jointly model the magnitude and duration of the consecutive positive log- 
returns by using BGG law. We call attention that the duration of the consecutive positive 
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Figure 3: Graphics of the daily exchange rates and log-returns. 

log-returns is the same that the duration of the growth periods of the exchange rates. 
The data set consists of 549 pairs pQ, iVj), where X^ and iVj are the magnitude and 
duration as described before, for i = 1, . . . , 549. We notice that this approach of looking 
jointly at the magnitude and duration of the consecutive positive log-returns was firstly 
proposed by Kozubowski and Panorska (2005) with the BEG model, which showed a 
good fit to another currencies considered. Suppose {(Xi, N^)}^ are iid random vectors 
following BGG(/i,a,p) distribution. We work with the reparametrization proposed in 
the Subsection 3.1. 

Table [T] presents a summary of the fit of our model, which contains maximum likeli- 
hood estimates of the parameters with their respective standard errors, and asymptotic 
confidence intervals at the 5% significance level. Note that the confidence interval of a 
does not contain the value 1. Then, for the Wald test, we reject the hypothesis that the 
data come from BEG distribution in favor of the BGG distribution, at the 5% signifi- 
cance level. We also perform likelihood ratio (LR) test and obtain that the LR statistic 
is equal to 5.666 with associated p-value 0.0173. Therefore, for any usual significance 
level (for example 5%), the likelihood ratio test rejects the hypothesis that the data come 
from BEG distribution in favor of the BGG distribution, so agreeing with Wald test's 
decision. The empirical and fitted correlation coefficients are equal to 0.6680 and 0.6775, 
respectively, therefore, we have a good agreement between them. 



Parameters 


Estimate 


Stand, error 


Inf. bound 


Sup. bound 




0.0082 


0.00026 


0.0076 


0.0087 


a 


0.8805 


0.04788 


0.7867 


0.9743 


P 


0.5093 


0.01523 


0.4794 


0.5391 



Table 1: Maximum likelihood estimates of the parameters, standard errors and bounds 
of the asymptotic confidence intervals at the 5% significance level. 

The BEG model was motived by an empirical observation that the magnitude of the 
consecutive positive log-returns followed the same type of distribution as the positive 
one-day log- returns (see Kozubowski and Panorska, 2005). Indeed, the marginal distri- 
bution of X in the BEG model is also exponential (with mean p _1 /i), just as the positive 
daily log-returns (with mean /i). This stability of the returns was observed earlier by 
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Kozubowski and Podgorski (2003), with the log-Laplace distribution. We notice that 
BGG distribution does not enjoy this stability property, since the marginal distribution 
of X is an infinite mixture of gamma distributions. We now show that the data set 
considered here does not present this stability. 

Denote the ith positive one-day log-returns by Di and define D* = p~ 1 D i . If the 
data was generated from a BEG(/x,p) distribution, then an empirical quantile-quantile 
plot between the X^s (y-axis) and the ZVs (x-axis) would be around the straight line 
y = p~ 1 x, for x > 0. Figure [4] presents this plot and we observe that a considerable part 
of the points are below of the straight line y = 1.9636a; (we replace p by its MLE p = 
0.5093). Therefore, the present data set seems to have been generated by a distribution 
that lacks the stability property discussed above. In order to confirm this, we test the 
hypothesis that the Xj's and -D*'s have the same distribution. In the BEG model, both 
have exponential distribution with mean /i. Since p converges in probability to p (as 
n — > oo), we perform the test with p replacing p. The Kolmogorov-Smirnov statistic 
and associated p-value are equal to 0.0603 and 0.0369, respectively. Therefore, using a 
significance level at 5%, we reject the hypothesis that the X^s and -D*'s have the same 
distribution. 




0.000 0.005 0.010 0.015 

Posilive one-day log-returns 



Figure 4: Empirical quantile-quantile plot between cumulative consecutive positive log- 
returns and positive one-day log-returns, with the straight line y = 1.9636a;. The range 
(x,y) G (0,0.015) x (0,0.030) covers 85% of the data set. 

Figure [5] presents the fitted marginal density (mixture of gamma densities) of the 
cumulative log-returns with the histogram of the data and the empirical and fitted survival 
functions. These plots show a good fit of the mixture of gamma distributions to the data. 
This is confirmed by the Kolmogorov-Smirnov (KS) test, which we use to measure the 
goodness-of-fit of the mixture of gamma distributions to the data. The KS statistic 
and its p-value are equal to 0.0482 and 0.1557, respectively. Therefore, using any usual 
significance level, we accept the hypothesis that the mixture of gamma distributions is 
adequate to fit the cumulative log-returns. 

Plots of the histogram, fitted gamma density and empirical and fitted survival func- 
tions for the daily positive log-returns are presented in the Figure |6j The good perfor- 
mance of the gamma distribution may be seen by these graphics. In the Table [2] we 
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Figure 5: Plot on the left shows the fitted mixture of gamma densities (density of X) with 
the histogram of the data. Plot on the right presents the empirical and fitted theoretical 
(mixture of gamma) survival functions. 




Empirical 

— Gamma 



0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.00 0.01 0.02 0.03 0.04 0.05 

Daily positive log-returns Daily positive log-returns 

Figure 6: Picture on the left shows the histogram and fitted gamma density for the 
daily positive log-returns. Empirical survival and fitted gamma survival are shown in the 
picture on the right. 



show absolute frequency, relative frequency and fitted geometric model for the duration 
in days of the consecutive positive log-returns. From this, we observe that the geometric 
distribution fits well the data. This is confirmed by the Pearson's chi-squared (denoted 
by x 2 ) test, where our null hypothesis is that the duration follows geometric distribution. 
The x 2 statistic equals 42 (degrees of freedom equals 36) with associated p-value 0.2270, 
so we accept (using any usual significance level) that the growth period follows geometric 
distribution. We notice that geometric distribution has also worked quite well for mod- 
eling the duration of the growth periods of exchange rates as part of the BEG model in 
Kozubowski and Panorska (2005). 

So far our analysis has showed that the bivariate gamma-geometric distribution and 
its marginals provided a suitable fit to the data. We end our analysis verifying if the 
conditional distributions of the cumulative log-returns given the duration also provide 
good fits to the data. As mentioned before, the conditional distribution of X given 
N = n is T(na,a/ fj). Figure [7] shows plots of the fitted density and fitted survival 
function of the conditional distributions of X given N = 1,2,3. The histograms of the 
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N ->■ 


1 


2 


3 


4 


5 


6 


> 7 


Absolute frequency 


269 


136 


85 


34 


15 


6 


4 


Relative frequency 


0.48998 


0.24772 


0.15483 


0.06193 


0.02732 


0.01093 


0.00728 


Fitted model 


0.50928 


0.24991 


0.12264 


0.06018 


0.02953 


0.01449 


0.01396 



Table 2: Absolute and relative frequencies and fitted marginal probability mass function 
of N (duration in days of the growth periods). 






Figure 7: Plots of the fitted conditional density and survival functions of X given N = 1, 
N = 2 and N = 3. In the pictures of the density and survival functions, we also plot the 
histogram of the data and the empirical survival function, respectively. 

data and the empirical survival functions are also displayed. The corresponding graphics 
for the conditional distributions of X given N = 4, 5 are displayed in the Figure [8j These 
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Empirical 
Gamma 



0.00 0.02 0.04 0.06 0.08 0.10 0.00 0.02 0.04 0.06 0.08 

Four-day cumulative log-returns Four-day cumulative log-returns 




Figure 8: Plots of the fitted conditional density and survival functions of X given N = A 
and N = 5. In the pictures of the density and survival functions, we also plot the 
histogram of the data and the empirical survival function, respectively. 

graphics show a good performance of the gamma distribution to fit cumulative log-returns 
given the growth period (in days). We also use the Kolmogorov-Smirnov test to verify the 
goodness-of-fit these conditional distributions. In the table [3] we present the KS statistics 
and their associated p-values. In all cases considered, using any usual significance level, 
we accept the hypothesis that the data come from gamma distribution with parameters 
specified above. 



Given N — > 


one-day 


two-day 


three-day 


four-day 


five-day 


KS statistic 


0.0720 


0.0802 


0.1002 


0.1737 


0.2242 


p- value 


0.1229 


0.3452 


0.3377 


0.2287 


0.3809 



Table 3: Kolmogorov-Smirnov statistics and their associated p-values for the goodness- 
of-fit of the conditional distributions of the cumulative log-returns given the durations 
(one-day, two-day, three-day, four-day and five-day). 



5 The induced Levy process 

As seen before, the bivariate gamma-geometric distribution is infinitely divisible, there- 
fore, we have that ([91 is a characteristic function for any real r > 0. This characteristic 



15 



function is associated with the bivariate random vector 



(R(r),v(r))= [J2 X * + G,r + T 



,i=l 



where {X i \ ( ^ l are iid random variables following T(a, (3) distribution, G ~ Y{ra, (3), T is 
a discrete random variable with NB(r, p) distribution and all random variables involved 
are mutually independent. Hence, it follows that the BGG distribution induces a Levy 
process {(X(r), NB(r)), r > 0}, which has the following stochastic representation: 

r /nb(t) \ *\ 

{(X(r), N(r)), r > 0} = I f Xi + G ( r )' r + NB ( r ) j ' r - p ( 17 ) 

where the X^s are defined as before, {G(r), r > 0} is a gamma Levy process and 
{NB(r), r > 0} is a negative binomial Levy process, both with characteristic functions 
given by 



and 



1 ' \1 - (1 - p)e ts 1 



respectively. All random variables and processes involved in (17) are mutually indepen- 
dent. 

From the process defined in (17), we may obtain other related Levy motions by delet- 
ing r and/or G(r). Here, we focus on the Levy process given by (17) and by deleting r. 
In this case, we obtain the following stochastic representation for our process: 

{(X(r), NB(r)), r > 0} = {(G(r + NB(r)), NB(r)) , r > 0} . (18) 



Since both processes (the left and the right ones of the equality in distribution) in ( pL8[ ) 
are Levy, the above result follows by noting that for all fixed r, we have + 
G(r)|NB(r) = k ~ T(a(r + k), (3). One may also see that the above result follows from the 
stochastic self-similarity property discussed, for example, by Kozubowski and Podgorski 
(2007): a gamma Levy process subordinated to a negative binomial process with drift is 
again a gamma process. 

The characteristic function corresponding to the (18) is given by 

[ ' J - [C > \{/3- it)* - e-/3«(l -p) J ' 1 J) 

for i,sGl. With this, it easily follows that the characteristic function of the marginal 
process {X(r), r > 0} is 



E (e ltx{r) ) = { ^- V 
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Since the above characteristic function corresponds to a random variable whose density is 
an infinite mixture of gamma densities (see Subsection 5.1), we have that {X(r), r > 0} 
is an infinite mixture of gamma Levy process (with negative binomial weights). Then, 
we obtain that the marginal processes of {(X(r), NB(r)), r > 0} are infinite mixture of 
gamma and negative binomial processes. Therefore, we define that {(X(r), NB(r)), r > 



0} is a BMixGNB(/3, a,p) Levy process. We notice that, for the choice a = 1 in (18), we 
obtain the bivariate process with gamma and negative binomial marginals introduced by 
Kozubowski et al. (2008), named BGNB Levy motion. 

As noted by Kozubowski and Podgorski (2007), if {NB(r), r > 0} is a negative bi- 
nomial process, with parameter q £ (0,1), independent of another negative binomial 
process {NB(r), r > 0} with parameter p £ (0,1), then the changed time process 
{NB*(r), r > 0} = {NB(r + NB(r)), r > 0} is a negative binomial process with pa- 



rameter p* = pq/(l —p + pq). With this and (18), we have that the changed time process 



{(G(r + NB*(r)), NB(r + NB(r))), r > 0} is a BMixGNB(/3, a,p*) Levy process. 

In what follows, we derive basic properties of the bivariate distribution of the BMixGNB 
process for fixed r > and discuss estimation by maximum likelihood and inference for 
large sample. From now on, unless otherwise mentioned, we will consider r > fixed. 

5.1 Basic properties of the bivariate process for fixed r > 

For simplicity, we will denote (Y,M) = (X(r), NB(r)). From stochastic representation 



(18), it is easy to see that the joint density and distribution function of (Y,M) are 



and 



9rMv, n) = r lrMrf T + *|> H»y (20) 
n\T(r)T(a(r + n)) 



P(Y < y, M < „) = Jfc g(l - ri' ^ D,^ + »), 



for y > and n £ N U {0}. Making a = 1 in (20), we obtain the BGNB distribution 
(bivariate distribution with gamma and negative binomial marginals) as particular case. 
This model was introduced and studied by Kozubowski et al. (2008). We have that the 
marginal distribution of M is negative binomial with probability mass function given in 
Q . The marginal density of Y is given by 

oo 

9y(v) = ^P(M = n)g{y;a{r + n),P), y > 0, 

n=0 

where g(-; a, 0) is the density of a gamma variable as defined in the Section 2. Therefore, 
the above density is an infinite mixture of gamma densities (with negative binomial 
weigths). Since the marginal distributions of (Y,M) are infinite mixture of gamma and 
negative binomial distributions, we denote (Y, M) ~ BMixGNB(/3, a,p, r). Some plots of 
the marginal density of Y are displayed in the Figure |9j for (3 = 1 and some values of a, 
p and r. 
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Figure 9: Graphics of the marginal density of Y for f3 = 1, a = 0.5, 1, 2, 3,4, p = 0.2, 0.8 
and r = 0.7, 2. 



The conditional distribution of Y\M = k is gamma with parameters a(r + fc) and (3, 
while the conditional probability distribution function of M\Y = y is given by 

P(M = n|r = y) = , r r / W /I } ^ [(l-P)Wr/ £ WTr^Kl"?)^. 
nil + r)) / ~^ j'l + ^)) 

for n = 0, 1, . . ., which belongs to one-parameter power series distributions if a and r are 
known. In this case, the parameter is (1 — p){(3y) a - For positive integers m <n and real 
y > 0, it follows that 



P(Y < y, M < m\M < n) = £ E |^^ r^(a(r + j} ) / £ 3^(1 - P )' 
and for < a; < y and positive integer n 



P(Y<x,M<n\Y<y)- M 1 3)) 



^i=0 j!r(a(j+r)) 1 /3yv"v' 

The moments of a random vector (Y, M) following BMixGNB(/3, r) distribution 
may be obtained by E(Y n M k ) = (-i) n+k d n+k $*(t, s)/dt n ds k \ t , s=0 , where $*(t,s) is the 



characteristic function given in (19). It follows that the product moments are given by 



E{ yn Mk) = fm y m ^~ Pg^V + ij , (21) 

p n r(r) z — ' m'Sfafr + m), n) 



m=0 
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The covariance matrix of (Y, M) is given by rS, where £ is defined in ([7]). The correlation 
coefficient is given by p, which is defined in the Subsection 2.2. Further, an expression for 



the nth marginal moment of Y may be obtained by taking k — in (21 ). If {W 7 (r), r > 0} 
is a BMixGNB(/3, a,p) Levy motion, one may check that cov(W(t), W(s)) = min(£, s)S. 

The BMixGNB law may be represented by a convolution between a bivariate distribu- 
tion (with gamma and degenerate at marginals) and a compound Poisson distribution. 
Such a representation is given by 



(Y,M)±(G,0) + Y t (G i ,Z i 



i=l 



with all random variables above defined as in the formula (10), but here we define G ~ 
r(ar, /3) and A = — rlogp. We end this Subsection by noting that if {(Y;,Mj)}" =1 are 
independent random vectors with (Yj,Mj) ~ BMixGNB(/3, a,p, r^), then 



ii 

8=1 \ 1=1 



J2(Yi, ~ BMixGNB lp,a,p,J2 



One may easily check the above result by using characteristic function (19). 



5.2 Estimation and inference for the BMixGNB distribution 

Suppose (Yi, Mi), . . . , (Y n , M n ) is a random sample from BMixGNB(/3, a,p, r) distribu- 
tion. Here the parameter vector will be denoted by 9^ = ({3,a,p,r) T . The log-likelihood 
function, denoted by t\ is given by 

v oc n{ra log/3 — logr(r) + r \ogp] — n/3X n + n{log(l — p) + a\og/3}M n 

n n n 

+ ] °g r ( M * + r ) - log r ( a ( Mi + r )) + « S^ Mi + r ) lo s x " 

i=l i=l i=l 

where M n = YZ=i Mi/n. 

The associated score function W(9^) = {dft /d(3,dft /da,dft /dp,dft /dr) has its com- 
ponents given by 

^ = n(-(r + M n )-X n L 



9£t 
9a 



r + M n ) log/3 + + M 4 ){logX, - *(a(r + M,))}, 



i=l 

+ 



dp 1 — p p 

dr 



{\og(p/3 a ) - *(r)} + ^ {aflogX, - *(a(r + Mi))] + *(r + M,)} . 



n ^ 

i=i 



Hence, the maximum likelihood estimators of and p are respectively given by 

^ = £ + Mn and ?= _*_ (22) 
A n r + M n 
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while the maximum likelihood estimators of a and r are found by solving the nonlinear 
system of equations 

S^^J + Y^(?+Mi){logXi - *(S(f + Mi))} = 



and 



a In log ( a 



.r + M, 



X n 



n < Wit) - lo 



) +^{logX,-M/(a(f+M,))}| = 

n 

-£*(r + A*i). 



r + M r , 



i=i 



After some algebra, we obtain that Fisher's information matrix is 



/ iJpp K^ a 



Jt(0t 



4 







A 



K 1 
"or 

// ! /ft 

"-pp "'pr 

/ft /ft 
( I t ™pr TT 



with 



K 



K 



K , 



t 



/9V 

pT 



T 



/3r 



T 



j2(T+j) 2 (i- P yv(a(T+j)) 



3=0 
oo 



F(r + j) 



pp 



K. 



J=0 



J! 



pr 



P 2 (l -P)' 

1 

P 



*'( T ) + ffr E(! - p) V^teO" + i)) " *V + j)} 



r(r + j; 



(23) 



So we obtain that the asymptotic distribution of y/n(9^ — 9^) is trivariate normal with 
null mean and covariance matrix Jt-i(#t) ; where Jt- 1 ^.) j s the inverse of the information 
matrix Jt(-) defined above. The likelihood ratio, Wald and Score tests may be performed 
in order to test the hypotheses H : a = 1 versus Hi. a ^ 1, that is, to compare BGNB 
and BMixGNB fits. Further, we may test the BMixGNB model versus the BGG or 
BEG models, which corresponds to the null hypotheses H : r = 1 and H : a = r = 1, 
respectively. 

As made in the Subsection 4.2, we here propose the reparametrization \i = a/ ' (3. We 
now denote the parameter vector by 9* = (//, a,p, r) . With this, one may check that 
the MLEs of p and /i are given by (22 ) and /i = X n j (r + M n ). The MLEs of r and a are 
obtained by solving the nonlinear system of equations ( 23 ) and 

?+ Mr,\ T + M„ 



n(?+M n )Uog[a 



X r , 



x„ 



- \ + E( ?+ Mi){logXi - tf(a(r + Af<))} = 0. 
J i=i 
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Under this proposed reparametrization, the Fisher's information matrix becomes 



J*(0*) 























K ar 












K P T 














K pr 


K TT 


) 



where its elements are given by 



K 



ar 



* a 



K = 



P 



3l-^(r + j)V(a(r+j)) 



K r . 



ap 



3=0 



r(r + j) r 
j\ ap' 



-3l-pF(r +J )$'(a(r +j )) 



r(r)f 
t 



1, 



pp 



ft 



PP' 



ft 



j=0 

and 



pr pr 

We have that k 



Kl 



lia 



0, that is, fj, and a are orthogonal parameters in contrast with 
the parameters (3 and a considered previously, where Kp a ^ 0. Further, we have that 

^(9* — 6*) — > N A (0, J* _1 (^*)) as n — > oo, where the covariance matrix J* _1 (^*) is the 
inverse of the information matrix J* '(#*). 



6 Concluding remarks 

We introduced and studied the bivariate gamma-geometric (BGG) law, which extends 
the bivariate exponential-geometric (BEG) law proposed by Kozubowski and Panorska 
(2005). The marginals of our model are infinite mixture of gamma and geometric distribu- 
tions. Several results and properties were obtained such as joint density and survival func- 
tions, conditional distributions, moment generation and characteristic functions, product 
moments, covariance matrix, geometric stability and stochastic representations. 

We discussed estimation by maximum likelihood and inference for large sample. Fur- 
ther, a reparametrization was suggested in order to obtain orthogonality of the parame- 
ters. An application to exchange rates between Brazilian real and U.K. pounds, quoted 
in Brazilian real, was presented. There our aim was to model jointly the magnitude and 
duration of the consecutive positive log-returns. In that application, we showed that the 
BGG model and its marginal and conditional distributions fitted suitably the real data set 
considered. Further, we performed the likelihood ratio and Wald tests and both rejected 
(with significance level at 5%) the hypothesis that the data come from BEG distribution 
in favor of the BGG distribution. 

We showed that our bivariate law is infinitely divisible and, therefore, induces a Levy 
process, named BMixGNB Levy motion. We also derived some properties and results of 
this process, including a study of its distribution at fixed time. Our proposed Levy motion 
has infinite mixture of gamma and negative binomial marginal processes and generalizes 
the one proposed by Kozubowski et al. (2008), whose marginals are gamma and negative 
binomial processes. Estimation and inference for the parameters of the distribution of 
our process at fixed time were also discussed, including a reparametrization to obtain a 
partial orthogonality of the parameters. 
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